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Abstract — We consider a multi-pair relay channel where multi- 
ple sources simultaneously communicate with destinations using 
a relay. Each source or destination has only a single antenna, 
while the relay is equipped with a very large antenna array. We 
investigate the power efficiency of this system when maximum 
ratio combining/maximal ratio transmission (MRC/MRT) or 
zero-forcing (ZF) processing is used at the relay. Using a very 
large array, the transmit power of each source or relay (or 
both) can be made inversely proportional to the number of 
relay antennas while maintaining a given quality-of-service. At 
the same time, the achievable sum rate can be increased by a 
factor of the number of source-destination pairs. We show that 
when the number of antennas grows to infinity, the asymptotic 
achievable rates of MRC/MRT and ZF are the same if we scale 
the power at the sources. Depending on the large scale fading 
effect, MRC/MRT can outperform ZF or vice versa if we scale 
the power at the relay. 

I. Introduction 

Multiple-input multiple-output (MIMO) technology has now 
become an integral feature of many advanced communica- 
tion systems. A cellular architecture with MIMO that has 
gained significant research interest is multi-user MIMO (MU- 
MIMO) in which an antenna array simultaneously serves a 
multiplicity of autonomous co-channel users/nodes (TJ. While 
the current systems have limited number of antennas (e.g. 
the LTE standard allows for up to 8 antenna ports), MU 
systems having a large number of antennas at the base station 
(very large MIMO) have been advocated recently in (2]- 
flU. Very large MU-MIMO systems can substantially reduce 
the interference with simple signal processing techniques and 
achieve increased reliability and throughput, and significant 
reduction in total transmitted power (0. 

On the other hand, relaying has been extensively explored 
to provide expanded coverage and high throughput, especially 
at the cell edge J6|. However, inter-user interference can 
cause major performance degradation in MU relay systems 
0. As a result, a large body of performance analysis work 
on MU relay systems, e.g., ll8l- lfT0l has mainly avoided in- 
terference slots by adopting spectrally inefficient policies such 
as orthogonalization of time/frequency. Another line of work 
on MU relay systems has considered deploying complicated 
precoder/decoder designs; e.g., in ifTTI and advanced joint 
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network coding and signal alignment techniques for the multi- 
pair two-way relay channel lfl2l . 

In this paper, we analyze the performance of a multi- 
pair relaying scenario where a group of K sources and K 
destinations communicate using a single relay equipped with 
N antennas, where N 3> K. To the best of our knowledge, 
there is no prior work that analyzes the effects of large antenna 
arrays on the performance of the considered relay system. 
In the first time slot, all K sources simultaneously transmit 
their signals to the relay. In the second time slot, a linearly 
transformed version of the received signal at the relay is 
forwarded to the K destinations. For this multi-pair relay 
channel, we study the achievable rate vs. power efficiency 
performance with (1) maximum ratio combining/maximal ratio 
transmission (MRC/MRT) and (2) zero-forcing (ZF) at the 
relay. 

We show that when N is large, we can cut the transmit 
power at each source or/and the relay proportionally to 1/N 
with no performance degradation. The asymptotic achievable 
rates of MRC/MRT and ZF for N — > oo are derived for cases 
when the transmit power of each source or/and the relay is 
made inversely proportional to N. The results show that when 
the transmit power of each source scales as 1 /N while keeping 
a fixed transmit power at the relay, the fast fading, interference 
from other sources, and noise at the destination disappear 
and hence, the system performance does not depend on the 
quality of the channel in the second hop. In contrast, for the 
case when the transmit power of each source is fixed and the 
transmit power of the relay is scaled down as 1/N, the system 
performance does not depend on the channel quality of the first 
hop. We further show that when the transmit power at the relay 
is cut proportionally to 1/N, with very large N, depending on 
the large-scale fading effect, MRC/MRT performs better than 
ZF or vice versa. 

Notation: f, || • || and Tr(-) denote the conjugate transpose 
operation, Euclidean norm and the trace of a matrix respec- 
tively. E{x} stands for the expectation of a random variable x, 

and Im is the identity matrix of size M. -4' and -4- denote the 
almost sure convergence and the convergence in distribution, 
respectively. 

II. System Model 

Consider a scenario with a group of K sources, Sk, K 
destinations, Dk, for k = 1,...,K, and a single relay, R. 




Fig. 1 . System model. The channel strengths of the source-relay, (S k — R) 
and relay-destination, (R — D k ) links for k = 1, . . . , K are r\\ k and 772^, 
respectively. 



Each source/destination is equipped with a single antenna 
while R is equipped with N antennas as shown in Fig. 1. 
The source Sk wants to communicate with the destination Dk. 
Communication in this network occurs via R since there are 
no direct links among Sk and D k, for any k due to heavy 
shadowing and path loss phenomenon. 

During the first phase all sources simultaneously transmit 
their symbols to R, and the received N x 1 signal vector can 
be written as 



Vr 



r P t Gix + n R , 



(1) 



where \fP~tX = VPl^ii X2, ■ • ■ , xk] T are transmitted symbols 
with i \ xx } = Ik (the average transmitted power of each 
source is P t ) and ur is an N x 1 additive white Gaussian noise 
(AWGN) vector at the relay node with E{riRn R } = cj^Jn- 

The N x K channel matrix between the K sources and R is 

1/2 

expressed as G\ = H\D^ where H\ contains independent 
and identically distributed (i.i.d.) CM (0, 1) entries and D\ is 
a K x K diagonal matrix, where [D{\kk = Vik- Moreover, we 

model the N x K channel matrix between the K destinations 

1/2 

and R as G2 = H2D2 where H2 contains i.i.d. CM (0, 1) 
entries and D2 is a K x K diagonal matrix, where [Z^U-fc = 
rj2k- Note that H 1 and H2 represent independent fast fading, 
while D\ and D2 represent path-loss attenuation, and log- 
normal shadow fading. The assumption of independent fast 
fading is sufficiently realistic for systems where the antennas 
are spaced sufficiently far apart 0. 

During the second phase, R re-transmits a transformation 
of the received signal given by y R = Wy R . The signal at Dk 
can be expressed as 

K 

VD k = \fP\g\ h Wg lh x k + y/P t 92 k W 9u x i 



g\Wn R + n Dk , 



(2) 



where g x , is the i-th column of G\, g 2k is the fc-th column 
of G2, W is the N x N transformation matrix normal- 



ized to satisfy a total power constraint, P r , at the relay as 
Tr(E^y R y R j^j = P r , and n Dk is the AWGN at D k with 

E{n Dk rJ Dk } = a\. 

As a result, the instantaneous end-to-end (e2e) signal-to- 
interference-noise ratio (SINR) at Dk can be written as 



7fc = 



Pt\9tWg lk 



K 



(3) 
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A. MRC/MRT at the Relay 

When CSI is available at R, it is natural to apply a 
transformation based on the MRC/MRT principl^H In the first 
phase, the relay uses MRC to combine the signals transmitted 
from K sources and then in the second phase, it uses MRT 
precoding to forward data to K destinations. Hence the relay 
transformation matrix is given by W = a mrc G2G\. In this 
case, to meet the power constraint at the relay, we have 



P r 



Tr I P t ( G\G t ) G\G 2 + alG\G x G\G2 



(4) 



From ©, the received signal at D k for MRC/MRT at the 
relay is given by 

K 

UD k = a mrc ^J~P t g\ k G 2 G\g lk x k + a mrc ^J~P t ^ g\ k G 2 G\g x . x t 

i—l^i^k 

+ a m rcg\ k G2G\n R + n Dk . 
Hence, the e2e SINR can be expressed as 



(5) 
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B. ZF at the Relay 

We now consider the use of ZF receivers and precoders at 
the relay. With ZF processing, the transformation matrix can be 

expressed as W = a z fG2 (gI>G2^ ^G{G?iJ G\, where 
a z f is chosen to satisfy the power constraint at the relajQ, i.e., 



P 



x Tr (Pt(<^G 2 ) 



crl (G\G 2 



g\g, 



(7) 



'Note that the choice of W based on the MRC/MRT principle is not 
optimal for maximizing the SINR. However, finding the optimal W in an 
analytical form seems impossible due to the non-convex nature of the problem. 
Furthermore, with very large antennas arrays, the channel vectors are nearly 
orthogonal, and hence MRC/MRT is nearly optimal (2). 

2 Reference 1131 Sec. IV] also considers ZF at the relay and employs a fixed 
gain for long term power normalization. In contrast, we consider a variable 
gain in (7). Since for receive filtering/transmit ZF precoding at the relay, 
instantaneous channel information must be used and a fixed gain can result 
in high peak-to-average power ratio signals, the gain in Q is a good practical 
choice when implementing ZF at the relay. 



For this case, we have 

9 2k w 9u = a z fS ki , (8) 

where 6 k i = 1 when k = i and otherwise. Therefore, from 
d2J, we can write the received signal at D k as 

-l 



UD k = a zf y/P t x k + a zf 



n R + n D , 



(9) 



where [A] k is the fc-th row of the matrix A. 

Now we can express the e2e signal-to-noise ratio (SNR) as 

4* 



7fc 



(10) 



A-A- 



C. Orthogonal Scheme 

For comparison with MRC/MRT and ZF, we also consider 
a "naive scheme" that employs orthogonal channel access. 
Specifically, to completely avoid the inter-user interference, 
each Sk~Dk pair for k = 1, . . . , K uses ^ channel resources 
for communication. At the relay, MRC/MRT is employed as it 
maximizes the e2e SNKH Therefore, the e2e SNR at D k can 
be expressed as 



Pi\\9\A\ 2 p,\ 



3-2, 
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III. Large N Analysis 



In this section, we further simplify the e2e SINR expressions 
© and ([Tol l in the very large N regime. These new expressions 
illuminate several aspects of the achievable rate vs. power 
efficiency performance in the considered network. Here we 
assume that N ^> K. We further assume that when N 
is large, the elements of the channel matrices G\ and G2 
are still independent. Note that even with very large N, the 
physical size of the antenna array can be small. For example, 
at 2.6 GHz, a cylindrical array with 128 antennas and A/2 
antenna spacing occupies only a physical size of 28 cm x 29 
cm and even with this array, the antennas experience nearly 
independent fading f3), lfl4l . 

The achievable ergodic sum rate of the system is given byQ 

Csum = E {E^ l0 8'2 (1 + 7k) l . (12) 

where * = {mrc,zf, ns} refers to MRC/MRT, ZF and naive 
schemes and the pre-log factor i is due to the half-duplex 
relaying. For MRC/MRT and ZF: a\ = 1 and for the naive 
scheme af = K. 

3 In the remainder of the paper, "MRC/MRT" is used to refer to the scheme 
(cf. Section II-A) where K > 1 destinations are simultaneously served. 

4 The exact analysis of and Cf um for arbitrary N is not a mathe- 

matically tractable problem since the required probability density functions 
(p.d.f.s) of 7™ rc and 7^ do not readily permit mathematical manipulation. A 
closed-form expression C s n u s m can be derived, but not reported since our main 
focus in this paper is to analyze the impact of the very large array, where 



In the following analysis, we will consider three cases: 
namely, Case I) Fixed NP t ,N —> 00; Case II) Fixed 
NP r , N -> 00; Case III) Fixed NP t , Fixed NP r , N -> 00. 

A. MRC/MRT at the Relay 

Case I): If P t = %■ where E t is fixed, then from @ we 
have 



VD k 



N 
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(13) 



In the very large N regime, we apply the law of large numbers 
given by IIT31 



92 k 92i a.s 
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(14) 



and note that 

Na mrc = 



P 



Tr & (Old) G\G 2 + al^G\G x G\G2 



P 



n^oo y Jr (E t D\D 2 + 0ID1D2) ' 
Now re-expressing the first term in (TLTt as 
■g\G 2 G\g lk 

i=l 



(15) 



K f t 

j^ry^ 9 2k 



N V -L ^ J^ 2 

i=l 

Therefore, when tends to infinity, we have 
9 2k G 2 G\g lk a. s I 



N 



P r E t 

/ Tr (E t D\D 2 + (T%D 1D2) 



(16) 



Similarly, for i ^ k, re-expressing the second term in dlJt as 



g\G 2 G\g u 
a mr cVE t — — = Na n 



N 



N 2 



we obtain 



g\G 2 G\g u 
a\/E t — _ -> 0. 



N 2 

Note that the third term in ([T"3l can be written as 
g\ G 2 G\n R 



(17) 
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We apply the law of large numbers and the Lindeberg-Levy 
central limit theorem and obtairjfl 



9l k G 2 G\g u 



P 



N 



Tr (E t D{D 2 + a 2 n D l D 2 



-r]2knR, 



(18) 



Lindeberg-Levy central limit theorem: Let p and q be n X 1 vectors whose 
elements are i.i.d. random variables with zero mean and variances of Up and 

<jq, respectively. Then -^p H q^CAf (0, cr^a^) , as n — > 00. 



where n R ~ CAf (0,r/ lk a^). Substituting $16$, $Tj} and < fl~8T > 
into (13[ and since -j=nD k — > we have 




P r £ t 



Tr [E t DfD 2 + alD 1 D 2 



-r]ikr]2kXk 



Pr 



Tr [E t D(D 2 + alDyDi 



-i]2kn>R. 



Now from (fT9b we obtain 

EtTjlk 



as TV 



oo. 



(19) 



(20) 



For K = 1, it is clear that we can reduce the transmit power 
by a factor of 1/N with no reduction in performance due to 
the array gain. But here we consider multiple sources, and 
the above result implies that by using a large number of relay 
antennas, we can still obtain the same array gain as in the case 
of single source. Interestingly, from (|20j we can see that when 
TV grows large and NP t is fixed, the e2e SNR does not depend 
on the transmit power at the relay and the large-scale fading 
of the second hop. This is due to the fact that, with MRT 
precoding at the relay in the second phase and P r is fixed, 
as TV goes to infinity, the effect of inter-user interference and 
noise at Dk disappears. Finally, the sum rate follows directly 
by substituting into (fT2] i. 

Case II): If P r = jf- where E r is fixed, we observe that 



(21) 



Therefore when TV grows without bound, the first term in (0 
tends to 



P t gl k G 2 G\g lk 



"v^fey™- (22) 



Similarly, when TV goes to infinity, the second and third terms 
in (0 converge almost sure to 0. Therefore, 



VD k i / , 2 , VlkV2kXk +n Dk , as TV -> oo. (23) 
V Tr \ D i D V 



From fl23l , as TV — s- oo, we have 



Ik 



Tr(D{D 2 )al 



(24) 



The above result shows that the transmit power at relay 
can be made inversely proportional to the number of relay 
antennas without compromising the quality-of-service. Note 
that in the special case where rjik = r\\ and i] 2 k = V? f° r 
k = 1, . . . , K, we have 7jJ lrc — > -j? -^r- which does not depend 
on the transmit power of each source and the channel quality 
of the first hop. 



Case III): If P t = § and P r = § where E t and E r are 
fixed, using a similar approach as above we can show that 



VD k 



E t E r 



-r\ 2 knR + n Dk . (25) 



E r 



Therefore, 



Ik 



Tr {E t D\D 2 + o*D 1 D 2 



EtVi 



Tr(B t .Dj-D 2 +a-2.D 1 £> 2 ) 



as N — > oo. 



(26) 
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Interestingly, by using large antenna arrays, we can scale 
down both transmit powers of source and relay nodes by a 
factor of 1/N with no reduction in performance. Furthermore, 
we can see that when E t — > oo, the above SINR coincides 
with the result for the case P r is fixed, and when E r — > oo, 
the above SINR coincides with the result for the case P t is 
fixed. 

In the special case where rjik = rji and i} 2 k = 779 for k = 
1 , . . . , K; we have 7™ rc — > ^f- 

B. ZF at the Relay 

By following a similar derivation as in the case of 
MRC/MRT, we can obtain the same power scaling law as 
follows. 

Case I): In the very large N regime we first note that 



Pr 



^Tr (E t D^ 1 +ai(D 1 D 2 )- 1 
Then from <j9j we have 
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\ Tr (E t D^ + al (D 1 D 2 y 1 ) rj* k ^ 



(28) 



where ur ~ CAf (0, rjikcr^). Hence, when N grows without 
bound, we obtain 



zf aji. ExJ]\k 



Ik 



Case II): In this case for very large TV, a z f tends to 



a z f 



Er 



i Tr {P t D 2 
Therefore, we can write ((9) as 



(29) 



(30) 



VD k 



E r 



Tr (P t D 2 



PtXk + riD k , as — > 00. (31) 



which leads to 



E r 



, as N — >• oo. 



(32) 



In the special case when r]2k = 1)2 for k = 1 , . . . , K, we 
have 7fe ^i^. 

Case ///J: If P t = §f and P r = |f where E t and are 
fixed, when N tends to infinity, we obtain 



(Zzf a . s . 



E r 



\ Tr (l^ 1 + o* {D^y 1 



Therefore, 



E t E r 



\ Tr ^D^ + a^DxiJa)- 



(33) 



(34) 



^ Tr [E t D^ + CT 2 (D^)- 1 J 77? 



where ~ C7V (0, iji^a^j. Hence, when N goes to infinity, 
we obtain 



Tr(E t D^ 1 +al(D 1 D 2 )- 1 ) m 
E, 



(35) 



In the special case where ri\k = Vi an d Vzk — ? ?2 for k 



1 , . . . , K , we have 7^ 



Remark 1: In Case I) the asymptotic rate (N — > 00) of 
with MRC/MRT and ZF processing is same and given by 
CT C = C? = \ log 2 (1 j- Sai 



Remark 2: In Case //) the asymptotic rates at with 



MRC/MRT and ZF, C^ rc | Cf , is determined by 

A" 



1 K K 1 



(36) 



Remark 3: In Case III) the asymptotic rates at the fcth des- 
tine 
by 



tination with MRC/MRT and ZF, Cf K | Cf , is determined 
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i^k 
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-S-l^rturm 1 + — 2- ) < Z^^— ' 1 



i=£k 



(37) 

C. Orthogonal Scheme 

We now consider the asymptotic sum rate of the orthogonal 



scheme. In Case I): It is easy to show that 7 £ s 



and 



C s n u s m = 57? 1 log 2 1 + ^ • In Case II): 7 £ s h- ^ 
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Fig. 2. Case I): Sum rate vs. the number of relay antennas. E t = 10 dB, 
P r = 1 and K = 5 users are served. D± = D2 = Is. 
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Fig. 3. Case II): Sum rate vs. the number of relay antennas. P t = 1, E r = 10 
dB and K = 5 users are served. Di = D2 = Is- 



IV. Numerical Results 

The sum rates achieved by the multi-pair relay system are 
evaluated through simulations and compared with our asymp- 
totic analytical results. Without loss of generality, of t = 1 is 
assumed. 

Fig. 2 shows the simulated sum rate vs. the number of relay 
antennas and the presented analytical asymptotic results for 
Case I). Clearly, as the number of antennas increases, the sum 
rates of MRC/MRC, ZF and the naive schemes approaches 
the corresponding constant values predicted by our analysis. 
Interestingly, the sum rate curve of ZF has a sharper knee than 
the MRC/MRT counterpart on the way to the same asymptotic 
constant and the achieved sum rate is 8.65 bits/s/Hz. Though it 
is not explicitly seen in Fig. 2, for small number of antennas 
(N < 6), the naive scheme exhibits a better sum rate than 
MRC/MRT as expected. However, as N increases, the sum 
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Fig. 4. Case III): Sum rate vs. the number of relay antennas. P t 
E r = 10 dB and K = 5 users are served. D\ = D2 = ^5. 
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Case II): Per user rate vs. the number of relay antennas. K = 3 are 



rate offered by the naive scheme rapidly saturates while 
the sum rates of the MRC/MRT and ZF schemes show a 
rapid improvement. This is because with a limited number of 
antennas, interference cannot be significantly reduced and thus 
lowers the sum rate of MRC/MRT. But when N grows large, 
the random channel vectors between sources/destinations and 
relay become pairwise orthogonal and hence, the interference 
is canceled out. At the same time, we gain from simultaneously 
serving K source-destination pairs in the same time-frequency 
resource. 

Figs. 3 and 4 show results for the second and third power 
scaling laws; Case II) and Case III). Both MRC/MRT and 
ZF achieves the same sum rate of 4.73 and 3.36 bits/s/Hz in 
Case II) and Case III), respectively. Moreover, similar trends 
in results as in Fig. 2 can be observed. 

The rates achieved by individual destinations are illustrated 
in Fig. 5 for slow-fading coefficients; 7]n = 2, 7712 = 2, 7713 = 
2 and 7721 = 1, V22 = 3, 7/23 = 3 and Case II). Results in Fig. 
5 confirm that the sum rate of MRC/MRT can be higher than 
the sum rate of ZF depending on the slow fading parameters. 
C™ = 8.98 and Cf um = 8.90. Recall that < Cf um in 

the example of Fig. 3. Interestingly, when N < 350, all three 
users in the ZF system achieve a higher rate than the users in 
the MRC system. However, when N is very large, two users 
of the MRC achieve a higher rate than the ZF users. 

V. Conclusion 

We have shown that relay systems can benefit significantly 
from the use of very large antenna arrays. The offered sum 
rates of a multi-pair relay system was investigated for three 
different power scaling laws. At the relay, MRC/MRT and ZF 
processing was considered. We derived asymptotic sum rate re- 
sults and confirmed their accuracy using computer simulations. 
Several insights were extracted using the analysis to illuminate 
the comparative performances between MRC/MRT and ZF. 
For example, the asymptotic achievable rates of MRC/MRT 
and ZF are the same if we scale the power at the sources. 
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